Context-dependent outcome encoding in human reinforcement learning
نویسندگان
چکیده
A wealth of evidence in perceptual and economic decision-making research suggests that the subjective assessment one option is influenced by context. series studies provides same coding principles apply to situations where decisions are shaped past outcomes, is, reinforcement-learning situations. In bandit tasks, human behavior explained models assuming individuals do not learn objective value an outcome, but rather its subjective, context-dependent representation. We argue that, while such outcome context-dependence may be informationally or ecologically optimal, it concomitantly undermines capacity generalize value-based knowledge new contexts — sometimes creating apparent decision paradoxes.
منابع مشابه
Context-outcome associations underlie context-switch effects after partial reinforcement in human predictive learning
Predictive value for continuously reinforced cues is affected by context changes when they are trained within a context in which a different cue undergoes partial reinforcement. An experiment was conducted with the goal of exploring the mechanisms underlying this context-switch effect. Human participants were trained in a predictive learning situation in which a cue received partial reinforceme...
متن کاملContext-dependent stopper encoding
Abstra t. A hara ter-based en oding method is presented for naturallanguage texts and geneti data. Exa t string mat hing from the en oded text is faster than from the original text, with medium and longer patterns. A ompression ratio of about 50% is a hieved as a by-produ t. The method en odes hara ters with variable-length odewords of 2-bit base symbols. An advan ed variant is ontext-dependent...
متن کاملReinforcement Learning through Neural Encoding
Recent progress in the field of Reinforcement Learning (RL) has enabled to tackle bigger and more challenging tasks. However, the increasing complexity of the problems, as well as the use of more sophisticated models such as Deep Neural Networks (DNN), has impeded the ability to understand the behavior of trained policies. In this work, we present the Semi-Aggregated Markov Decision Process (SA...
متن کاملContext Tree Maximizing Reinforcement Learning
• Stochastic search approach [Nguyen et al 2011]: Parallel Tempering is utilized to find a good map based on the ΦMDP cost function. However, this approach is costly and does not guarantee finding the optimal state set or even a good one given a history •We propose an analytical and linear-time solution to this problem based on Context Tree Maximizing ∗Research School of Computer Science, Colle...
متن کاملLinear Feature Encoding for Reinforcement Learning
Feature construction is of vital importance in reinforcement learning, as the quality of a value function or policy is largely determined by the corresponding features. The recent successes of deep reinforcement learning (RL) only increase the importance of understanding feature construction. Typical deep RL approaches use a linear output layer, which means that deep RL can be interpreted as a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Current opinion in behavioral sciences
سال: 2021
ISSN: ['2352-1554', '2352-1546']
DOI: https://doi.org/10.1016/j.cobeha.2021.06.006